No Bullshit Guide to Statistics prerelease
minireference.comยท1h
๐Ÿ“ฐContent Curation
Assuring Agent Safety Evaluations By Analysing Transcripts
lesswrong.comยท9h
๐Ÿ†LLM Benchmarking
Evaluating Gemini 2.5 Deep Think's math capabilities
epoch.aiยท5hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
How leaderboards lost their spot as the best way to judge AI
platformer.newsยท18h
๐Ÿ†•New AI
The Future of AI is Verifiable Thought
pub.towardsai.netยท1h
๐ŸŽญClaude
Regression to the Mean
blog.engora.comยท4hยท
Discuss: Hacker News
๐ŸŽฏVector Quantization
How well can large language models predict the future?
forecastingresearch.substack.comยท23hยท
Discuss: Substack
๐Ÿ†LLM Benchmarking
Linear Risk Sharing on Networks
freakonometrics.hypotheses.orgยท20h
๐ŸŒDistributed systems
How different AI engines generate and cite answers
searchengineland.comยท7h
๐Ÿ“ŠFeed Optimization
Statistics in the Era of AI
scienceforeveryone.scienceยท21hยท
Discuss: Hacker News
๐Ÿ‘จโ€๐Ÿ’ปAI Coding
Show HN: Comparegpt.io โ€“ Trustworthy Mode to reduce LLM hallucinations
news.ycombinator.comยท18hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
MeteoSaver LLM based software for the transcription of historical weather data
egusphere.copernicus.orgยท11hยท
Discuss: Hacker News
๐ŸŒClimate
๐ŸŽฒ 6+ Favorite Books on Human Geography (So Far!)
nateshivar.comยท12h
๐Ÿ›๏ธPolitics
'Battleship'-style math can improve sustainable design, groundwater management, nuclear waste storage and more
phys.orgยท3h
โš›๏ธPhysics
Hackathon Winners: Plugins Designed for DevOps
usetrmnl.comยท21h
๐Ÿ”งDeveloper Tools
You don't avoid the chaos. You filter it.
threadreaderapp.comยท2h
๐ŸงนSpam Filters
Employers Use AI To Screen Resumes, So Applicants Use AI To Game It
thelowdownblog.comยท4hยท
๐ŸงนSpam Filters
Aeva: High Risk, Possibly High Reward Leads To A Hold For Now
seekingalpha.comยท24m
๐Ÿš€Startups
Men Are Betting on WNBA Players' Menstrual Cycles
wired.comยท8hยท
Discuss: r/TrueReddit
๐Ÿ”ƒFeed Algorithms